Improved speech recognition using a subspace projection approach

نویسندگان

  • Philipos C. Loizou
  • Andreas Spanias
چکیده

Two class-separability criteria based on the divergence measure are proposed to improve speech recognition performance. The average and weighted average divergence measures are used as criteria for nding a transformation matrix which maps the original features into a more discriminative subspace. Results are presented for a highly confusable task. Address correspondence to: Philipos C. Loizou Department of Applied Science Univ. of Arkansas at Little Rock Little Rock, AR 72204-1099 Phone:(501) 569-8067 Fax:(501) 569-8020 E-mail: [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition

Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...

متن کامل

Orthogonal Projection Pursuit Using Genetic Optimization

Projection pursuit is concerned with finding interesting lowdimensional subspace of multivariate data. In this paper we proposed a genetic optimization approach to find the globally optimal orthogonal subspace given training data and user defined criterion on what subspaces are interesting. We then applied this approach to human face recognition. Suppose face recognition is done by simple corre...

متن کامل

Single channel speech enhancement using principal component analysis and MDL subspace selection

We present in this paper a novel subspace approach for single channel speech enhancement and speech recognition in highly noisy environments. Our algorithm is based on principal component analysis and the optimal subspace selection is provided by a minimum description length criterion. This choice overcomes the limitations encountered with other selection criteria, like the overestimation of th...

متن کامل

Human Gait Recognition with 3D Wavelets and Kernel based Subspace Projections

Gait recognition can be regarded as a problem of uniquely representing spatiotemporal surfaces associated with a person’s walking pattern in an efficient manner. In this paper, we describe the approach of using projections of such surfaces onto subspace spanned by appropriate axes using a single framework. Two new algorithms for gait recognition are presented which use projection on subspace of...

متن کامل

Subspace modeling and selection for noisy speech recognition

This paper presents a new subspace modeling and selection approach for noisy speech recognition. In subspace modeling, we develop factor analysis (FA) for representing noisy speech. FA is a data generation model where the common factors are extracted with factor loading matrix and specific factors. We bridge the connection of FA to signal subspace (SS) approach. Interestingly, FA partitions noi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Speech and Audio Processing

دوره 7  شماره 

صفحات  -

تاریخ انتشار 1999